Chapter 8 


Virtual Memory 


Seventh Edition 
William Stallings 


You're gonna need a bigger boat. 


— Steven Spielberg, 
JAWS, 1975 


и Гу/о characteristics fundamental to memory 
management: 


1) all memory references are logical addresses that are 
dynamically translated into physical addresses at run time 


2) aprocess may be broken up into a number of pieces that 
don't need to be contiguously located in main memory 
during execution 


ш If these two characteristics are present, it 1$ not 
necessary that all of the pages or segments of a 
process be in main memory during execution 


Тег 


Virtual memory | А storage allocation scheme in which secondary memory can be 
addressed as though it were part of main memory. The addresses a 
program may use to reference memory are distinguished from the 
addresses the memory system uses to identify physical storage sites, 
and program-generated addresses are translated automatically to the 
corresponding machine addresses.The size of virtual storage is 
limited by the addressing scheme of the computer system and by the 
amount of secondary memory available and not by the actual 
number of main storage locations. 


Virtual address The address assigned to a location in virtual memory to allow that 
location to be accessed as though it were part of main memory. 


The virtual storage assigned to a process. 


The range of memory addresses available to a process. 


The address of a storage location in main memory. 


m Operating system brings into main memory a few pieces of the 
program 


m Resident set - portion of process that is in main memory 


m An interrupt is generated when an address is needed that is not 
in main memory 


m Operating system places the process 
in a blocking state 


Continued ... 


m Piece of process that contains the logical address 1s brought into 
main memory 


m operating system issues a disk I/O Read request 


m another process is dispatched to run while the disk I/O takes 
place 


m aninterrupt is issued when disk I/O is complete, which causes 
the operating system to place the affected process in the Ready 
state 


m More processes may be maintained in main memory 
m only load in some of the pieces of each process 


m with so many processes in main memory, it is very likely a 
process will be in the Ready state at any particular time 


ш А process may be larger than all of main memory 


Real and Virtual Memory 


* main memory, the actual RAM 


Virtual memory 


* memory on disk 


* allows for effective multiprogramming and relieves the 
user of tight constraints of main memory 


Table 8.2 


Characteristics of 
Paging and 


Segmentation 


Main memory 
partitioned into small 


fixed-size chunks called 


frames 


Program broken into 


pages by the compiler or 


memory management 
system 


Internal 
fragmentation 
within frames 


| | 
Simple Paging Virtual Memory Simple Segmentation Virtual Memory 
Paging Segmentation 


Main memory 
partitioned into small 


fixed-size chunks called 


frames 


Program broken into 


pages by the compiler or 


memory management 
system 


Internal fragmentation 
within frames 


Main memory not 
partitioned 


Main memory not 
partitioned 


Program segments 
specified by the 
programmer to the 
compiler (i.e., the 
decision is made by the 
programmer) 


Program segments 
specified by the 
programmer to the 
compiler (i.e., the 
decision is made by the 
programmer) 


No internal 
fragmentation 


No internal 
fragmentation 


No external No external External fragmentation | External fragmentation 
fragmentation fragmentation 


Operating system must 
maintain a page table 
for each process 
showing which frame 
each page occupies 


Operating system must 
maintain a free frame 
list 

Processor uses page 
number, offset to 
calculate absolute 
address 


АП the pages of a 
process must be in main 
memory for process to 
run, unless overlays are 
used 


Operating system must 
maintain a page table 
for each process 
showing which frame 
each page occupies 


Operating system must 
maintain a free frame 
list 


Processor uses page 
number, offset to 
calculate absolute 
address 


Not all pages of a 
process need be in main 
memory frames for the 
process to run. Pages 
may be read in as 
needed 


Reading a page into 
main memory may 
require writing a page 
out to disk 


Operating system must 
maintain a segment 
table for each process 
showing the load 
address and length of 
each segment 


Operating system must 
maintain a segment 
table for each process 
showing the load 
address and length of 
each segment 


Operating system must 
maintain a list of free 
holes in main memory 


Operating system must 
maintain a list of free 
holes in main memory 


Processor uses segment 
number, offset to 
calculate absolute 
address 


Processor uses segment 
number, offset to 
calculate absolute 
address 


АП the segments of a 
process must be in main 
memory for process to 
run, unless overlays are 
used 


Not all segments of a 
process need be in main 
memory for the process 
to run. Segments may 
be read in as needed 


Reading a segment into 
main memory may 
require writing one or 
more segments out to 
disk 


A state in which 
the system spends 


most of its time 
swapping process 
pieces rather than 
executing 
instructions 


Thrashing 


To avoid this, the 
operating system tries 
to guess, based on 
recent history, which 
pieces are least likely 
to be used in the near 
future 


m Program and data references within a process tend to cluster 


m Only a few pieces of a process will be needed over a short 
period of time 


m Therefore it is possible to make intelligent guesses about which 
pieces will be needed in the future 


m Avoids thrashing 


Paging Behavior 


E. o nun F 
ү ШТ | E! m During the lifetime of the 


| IA , А РЕА: г process, references are 
confined to a subset of pages 


Figure 8.1 Paging Behavior 


For virtual memory to be practical and 
effective: 


* hardware must support paging and 
segmentation 

* operating system must include software for 
managing the movement of pages and/or 
segments between secondary memory and 
main memory 


m The term virtual memory 1s usually associated with systems that 
employ paging 


m Use of paging to achieve virtual memory was first reported for 
the Atlas computer 


m Each process has its own page table 


m each page table entry contains the frame number of the 
corresponding page in main memory 


Virtual Address 


os Memory 


a АТСАН Management 


(a) Paging only Formats 


Virtual Address 


ЕТЕ И [ — Offset | 


Segment Table Entry 


PMDerCenzolBis Length | Segment Base 


(b) Segmentation only 


Virtual Address 


Segment Table Entry 


Page Table Entry 


Бран conoi ва Frame Number | P= present bit 
M = Modified bit 


(c) Combined segmentation and paging 


Address Translation 


Virtual Address Physical Address 


Register 


Paging Mechanism Main Memory 


Figure 8.3 Address Translation in a Paging System 


Two-Level 
Hierarchical Page Table 


4-kbyte root ТТ 


M 


Ни, cm 


Figure 8.4 A Two-Level Hierarchical Page Table 


4-Gbyte user 
address space 


ddre 


Address Translation 


4-kbyte page 
Root page table table (contains 
(contains 1024 PTEs) 1024 PTEs) 


Program Paging Mechanism Main Memory 


Figure 8.5 Address Translation in a Two-Level Paging System 


m Page number portion of a virtual address is mapped into a hash 
value 


m hash value points to inverted page table 


m Fixed proportion of real memory is required for the tables 
regardless of the number of processes or virtual pages supported 


m Structure is called znverted because 1t 1ndexes page table entries by 
frame number rather than by virtual page number 


Inverted Page Table 


| Virtual Address 
n bits 


Control 
bits 


m bits 


т bits 
Inverted Page Table Real Address 


(one entry for each 
physical memory frame) 


Figure 8.6 Inverted Page Table Structure 


Each entry ш the page table includes: 


Page Process Chain 
number identifier pointer 


* the process * includes * the index 
that owns flags and value of the 
this page protection next entry 


and locking inthe chain 
information 


Inverted Page Table 


m Rather than each process having a page table and keeping track 
of all possible logical pages, track all physical pages 


One entry for each real page of memory 


Entry consists of the virtual address of the page stored in that 
real memory location, with information about the process that 
owns that page 


ш Decreases memory needed to store each page table, but 
increases time needed to search the table when a page 
reference occurs 


B Use hash table to limit the search to one — or at most a few — 
page-table entries 


e TLB can accelerate access 
m But how to implement shared memory? 


e One mapping of a virtual address to the shared physical 
address 


Ы 


m Each virtual memory 
reference can cause two 


physical memory accesses: 


m one to fetch the page 
table entry 


m one to fetch the data 


m To overcome the effect of 
doubling the memory 
access time, most virtual 
memory schemes make 
use of a special high-speed 
cache called a translation 
lookaside buffer 


Use of a TLB 


Secondary 


Main Memory Memory 


Virtual Address 


Translation 
Lookaside Buffer 


Real Address 


Page fault 


Figure 8.7 Use of a Translation Lookaside Buffer 


Operation 
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m The ТІВ only contains some of the page table entries so we 
cannot simply index into ће ТІВ based on page number 


m each TLB entry must include the page number as well as the 
complete page table entry 


m The processor is equipped with hardware that allows it to 
interrogate simultaneously a number of TLB entries to 
determine if there is a match on page number 


Direct Versus 
Associative Lookup 


Virtual Address Virtual Address 
Page # Offset Page # Offset 


Page# PT Entries 
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Frame # Offset 
Real Address 


Translation Lookaside Buffer 


Page Table 


(a) Direct mapping (b) Associative mapping 


Figure 8.9 Direct Versus Associative Lookup for Page Table Entries 


Frame # Offset 


Real Address 


TLB and Cache Operation 


TLB Operation 
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LI 
1 Virtual Address 


Cache Operation 


“----------....................., 


Real Address 


Page Table 


Figure 8.10 Translation Lookaside Buffer and Cache Operation 


m The smaller the page size, the lesser the amount of internal 
fragmentation 


m however, more pages are required per process 
m more pages per process means larger page tables 


m for large programs in a heavily multiprogrammed environment 
some portion of the page tables of active processes must be in 
virtual memory instead of main memory 


m the physical characteristics of most secondary-memory devices 
favor a larger page size for more efficient block transfer of data 


_ Paging Behavior of a Program | 
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(b) Number of Page Frames Allocated 


N = total number of pages in process 


Figure 8.11 Typical Paging Behavior of a Program 


Example: Page Sizes 


Computer 
Atlas 
Honeywell-Multics 
IBM 370/ХА and 370/ESA 
VAX family 
IBM AS/400 
DEC Alpha 
MIPS 
UltraSPARC 
Pentium 
IBM POWER 


Itanium 


Pase Size 
512 48-bit words 
1024 36-bit words 
4 Kbytes 
512 bytes 
512 bytes 
8 Kbytes 
4 Kbytes to 16 Mbytes 
8 Kbytes to 4 Mbytes 
4 Kbytes or 4 Mbytes 
4 Kbytes 
4 Kbytes to 256 Mbytes 


Page Size. 


The design issue of main memory is 
page size is related to getting larger and 


the size of physical | address space used by 
main memory and applications is also 
program size growing 


m Contemporary programming 
techniques used in large 
programs tend to decrease the 


most obvious on 
и personal computers 
locality of references within a where applications are 
process becoming increasingly 
complex 


m segmentation 
allows the 
programmer to 
view memory as 
consisting of 
multiple address 
spaces or 
segments 


Advantages: 


simplifies handling 
of growing data 
structures 

allows programs to 
be altered and 
recompiled 
independently 
lends itself to 
sharing data 
among processes 


lends itself to 
protection 


m Each segment table entry contains the starting address of the 
corresponding segment in main memory and the length of the 
segment 


ш А bit is needed to determine if the segment is already 1n main 
memory 


m Another bit is needed to determine if the segment has been 
modified since it was loaded in main memory 


Address Translation 


Virtual Address 


T 


Segment Table 


© 


Segment Table 


Segment 


Program Segmentation Mechanism Main Memory 


Figure 8.12 Address Translation in a Segmentation System 


Segmentation 


; 
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In a combined 
paging/segmentation system Segmentation is visible to the 
a user's address space is programmer 
broken up into a number of 
segments. Each segment 1s 


broken up into a number of 
fixed-sized pages which are Paging is transparent to the 
equal in length to a main programmer 
memory frame 


Address Translation 


Virtual Address 


Seg Table Ptr 


Segment 
Table 


Offset Page 
Frame 


Program Segmentation 


Mechanism 


Paging 
Mechanism 


Main Memory 


Figure 8.13 Address Translation in a Segmentation/Paging System 


Combined Segmentation 
and Paging 


Virtual Address 


[Segment Number [ — Pase Number | Оба” 


Segment Table Entry 


Page Table Entry 


РМОФет Control Bits] | Frame Number P- present bit 
M - Modified bit 


(c) Combined segmentation and paging 


m Segmentation lends itself to the implementation of protection 
and sharing policies 


m Each entry has a base address and length so inadvertent memory 
access can be controlled 


m Sharing can be achieved by segments referencing multiple 
processes 


Protection | 


Relationships 


Address Мат Memory 


Branch instruction 
(not allowed) 


Reference to 
data (allowed) 


Process C 


==... Reference to 
data (not allowed) 


Figure 8.14 Protection Relationships Between Segments 


End-to-end Core i7 Address Translation 


siis e L2, 13, and 
Virtual address (VA) main memory 

12 
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Core i7 Level 1-3 Page Table Entries 


63 62 52 51 12 


11 9 8 7 6 5 4 3 2 1 O 
СКИТА 
Available for OS (page table location on disk) P20 


Each entry references a 4K child page table 

P: Child page table present in physical memory (1) or not (0). 

R/W: Read-only or read-write access access permission for all reachable pages. 
U/S: user or supervisor (kernel) mode access permission for all reachable pages. 
WT: Write-through or write-back cache policy for the child page table. 

CD: Caching disabled or enabled for the child page table. 

A: Reference bit (set by MMU on reads and writes, cleared by software). 

PS: Page size either 4 KB or 4 MB (defined for Level 1 PTEs only). 

G: Global page (don’t evict from TLB on task switch) 


Page table physical base address: 40 most significant bits of physical page table 
address (forces page tables to be 4KB aligned) 


Carnegie Mellon 


Core i7 Level 4 Page Table Entries 


63 62 52 51 


12 11 9 8 7 6 5 4 3 2 1 0 
m mes раене вен | оше [6 | [о [соты 
Available for OS (page location on disk) P20 


Each entry references a 4K child page 

P: Child page is present in memory (1) or not (0) 

R/W: Read-only or read-write access permission for child page 

U/S: User or supervisor mode access 

WT: Write-through or write-back cache policy for this page 

CD: Cache disabled (1) or enabled (0) 

A: Reference bit (set by MMU on reads and writes, cleared by software) 
D: Dirty bit (set by MMU on writes, cleared by software) 

G: Global page (don’t evict from TLB on task switch) 


Page physical base address: 40 most significant bits of physical page address 
(forces pages to be 4KB aligned) 


Carnegie Mellon 


Core i7 Page Table Translation 


Virtual 


9 9 9 9 12 
address 


L1 PT L2 PT L3 PT L4 PT 
Page global Page upper Page middle Page 
40 directory до directory ao directory 40 table 
CR3 
Physical 
address Offset into 
of L1 PT 12 physical and 
L1 PTE L4 PTE virtual page 
Physical 
address 
512 GB 1GB 2 МВ 4 КВ of page 
region region region region 
per entry per entry per entry per entry 


40 


pi b 
address 
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Virtual Memory of a Linux Process 


Process-specific data 
structs (ptables, 


Different for task and mm structs, 
each process Kernel 
kernel stack) ) 
virtual 
Е 4 memor 
Identical for Physical memory y 
each process Kernel code and data 
E 

Process 
virtual 


Runtime heap (malloc) memory 


Uninitialized data (.bss) 
Initialized data (.data) 
0x08048000 (32) Program text (.text) 


Memory mapped region 
for shared libraries 


Carnegie Mellon 


Linux Organizes VM as Collection of "Areas" 


Process virtual memory 
vm area struct 


task struct 


Shared libraries 


m pgd: 
" Page global directory address 
=" Points to L1 page table 


m vm prot: 
" Read/write permissions for 
this area 
m vm flags 


" Pages shared with other 
processes or private to this 
process 
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Linux Page Fault Handling 


Process virtual memory 


shared libraries 


vm area struct 


Segmentation fault: 
accessing a non-existing page 


Normal page fault 


Protection exception: 

e.g., violating permission by 
writing to a read-only page (Linux 
reports as Segmentation fault) 
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Memory Mapping 


m VM areas initialized by associating them with disk objects. 


" Process is known as memory mapping. 


m Area can be backed by (i.e., get its initial values from) : 
=" Regular file on disk (e.g., an executable object file) 
» Initial page bytes come from a section of a file 
" Anonymous file (e.g., nothing) 
= First fault will allocate a physical page full of O's (demand-zero page) 
" Once the page is written to (dirtied), it is like any other page 


m Dirty pages are copied back and forth between memory and a 
special swap file. 
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Demand paging 
m Key point: no virtual pages are copied into physical 


memory until they are referenced! 
= Known as demand paging 


m Crucial for time and space efficiency 
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Sharing Revisited: Shared Objects 


Bii e Physical 2. m Process 1 maps 
virtual memo memo virtual memo 
ШЫ Ы i the shared 
object. 
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object 
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Sharing Revisited: Shared Objects 


| Process 1 Physical | dinis 2 m Process 2 maps 
virtual memory js virtual memory the shared 

P EN object. 

C NEN ы. m Notice how the 

eX 2 virtual 

NN E addresses can 

А! P d be different. 

E 
object 
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Sharing Revisited: 
Private Copy-on-write (COW) Objects 


| Process 1 Physical | Process 2 ш Two processes 
virtual memory memory virtual memory : | 
Lu mapping a private 
"n di um copy-on-write 
CN Ts. 9 (COW) object. 
"E 7. } Private — ^ » Area flagged as 
PORT ps copy-on-write : 
Ne -*% f area private copy-on- 
ES d Ж 2 write 
S oe m PTEs in private 
ее. areas are flagged 
E и as read-only 
Private 


copy-on-write object 
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Sharing Revisited: 
Private Copy-on-write (COW) Objects 


Process 1 


virtual memory 


- 


Physical 
memory 


-7 
-7 
- 
-1 


- 
- 
--. 
-` 


Private 
copy-on-write object 


Process 2 
virtual memory 


^ 
“ 
Ы 
< 
> . 
pCopy-on-write 
- BEN 
: 
n idi M MEN 
-7 ^ М 
“М - 
ШЕ Я 


Write to private 
copy-on-write 
page 


Carnegie Mellon 


Instruction writing 
to private page 
triggers 
protection fault. 
Handler creates 
new R/W page. 
Instruction 
restarts upon 
handler return. 


Copying deferred 
as long as 
possible! 


Carnegie Mellon 


The fork Function Revisited 


m VM and memory mapping explain how fork provides private 
address space for each process. 


m Tocreate virtual address for new new process 


" Create exact copies of current mm struct,vm area struct, апа 
page tables. 


" Flag each page in both processes as read-only 
= Нас еасһ vm area struct in both processes as private COW 


m Onreturn, each process has exact copy of virtual memory 


m Subsequent writes create new pages using COW mechanism. 
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The execve Function Revisited 


изн | втш» To load and run a new 
program a. out in the 
current process using 
execve: 
m Free vm area struct's 


Memory mapped region 
for shared libraries 


Uninitialized data (.bss) 


Shared, file-backed NE 
| and page tables for old areas 


а Create vm area struct's 


and page tables for new 
Runtime heap (via malloc) | Private, demand-zero areas 
" Programs and initialized data 
| Private, demand-zero backed by object files. 
= .bss and stack backed by 
Initialized data (.data) anonymous files . 
Private, file-backed 
Program text (.text) 


m SetPCtoentry point 
іп . text 


= Linux will fault in code and 
data pages as needed. 
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User-Level Memory Mapping 


void *mmap(void *start, int len, 
int prot, int flags, int fd, int offset) 


ш Map len bytes starting at offset offset ofthe file specified 
by file description £d, preferably at address start 
" start: тау be 0 for "pick an address" 
= prot:PROT READ, PROT WRITE, ... 
" flags: MAP ANON, MAP PRIVATE, MAP SHARED, ... 


m Return a pointer to start of mapped area (may not be start) 
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User-Level Memory Mapping 


void *mmap(void *start, int len, 
int prot, int flags, int fd, int offset) 


P len bytes 


m" start 
P d (or address 
len bytes 4 |] o o O e chosen by kernel) 
offset ———» EE — 
(bytes) 
0 0 
Disk file specified by Process virtual memory 
file descriptor £d 


Using mmap to Copy Files 


m Copying without transferring data to user space . 


#include "csapp.h" /* mmapcopy driver */ 
int main(int argc, char **argv) 
Ы { 
* mmapcopy - uses mmap to сору struct stat stat; 
* file fd to stdout int fd; 


37 
void mmapcopy(int fd, int size) /* Check for required cmdline arg */ 
{ if (arge != 2) { 
printf("usage: $s <filename>\n”, 
/* Ptr to mem-mapped VM area */ argv[0]); 


char *bufp; exit(0); 
} 
bufp = Mmap(NULL, size, 
PROT READ, /* Copy the input arg to stdout */ 
MAP PRIVATE, fd, 0); fd = Open(argv[1], О RDONLY, 0); 
Write(1, bufp, size); Fstat(fd, &stat); 
return; mmapcopy (Ға, stat.st size); 
exit(0); 
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“> Memory-Mapped Files 


m Memory-mapped file ГО allows file I/O to be treated as routine 
memory access by mapping a disk block to a page in memory 


m Afileis initially read using demand paging 


e A page-sized portion of the file is read from the file system into 
a physical page 


e Subsequent reads/writes to/from the file are treated as 
ordinary memory accesses 


m Simplifies and speeds file access by driving Пе ИО through 
memory rather than read() and write() system calls 


ш Also allows several processes to map the same file allowing the 
pages in memory to be shared 


m But when does written data make it to disk? 
e Periodically and / or at file cLose( ) time 


e For example, when the pager scans for dirty pages 


ad 


Operating System Concepts - 9'^ Edition 9.57 Silberschatz, Galvin and Gagne €2013 
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Memory-Mapped File Technique for all I/O 


Some OSes uses memory mapped files for standard I/O 


Process can explicitly request memory mapping а file via mmap( ) 
system call 


e Now file mapped into process address space 
For standard I/O (open(), read(), write(), close()), mmap 
anyway 
e But map file into kernel address space 
e Process still does read() and write() 
» Copies data to and from kernel space and user space 
e Uses efficient memory management subsystem 
» Avoids needing separate subsystem 
COW can be used for read/write non-shared pages 
Memory mapped files can be used for shared memory (although 


again via separate system calls) 
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Operating System Software 


The design of the memory management 


portion of an operating system depends on 
three fundamental areas of choice: 


* whether or not to use virtual memory techniques 
* the use of paging or segmentation or both 


• the algorithms employed for various aspects of 
memory management 


Policies for Virtual Memory 


| 
ш Key issue: Pr О 


" minimize page faults 


Fetch Policy 
Demand paging 
Prepaging 


Placement Policy 


Replacement Policy 


Basic Algorithms 
Optimal 
Least recently used (LRU) 
First-in-first-out (FIFO) 
Clock 

Page Buffering 


Resident Set Management 
Resident set size 
Fixed 
Variable 
Replacement Scope 
Global 
Local 


Cleaning Policy 
Demand 
Precleaning 


Load Control 


m Determines when a 
page should be Two main 


brought into types: 
memory 


Demand 


Paging Prepaging 


m Demand Paging 


m only brings pages into main memory when a reference is made 
to a location on the page 

m many page faults when process 1$ first started 

m principle of locality suggests that as more and more pages are 
brought in, most future references will be to pages that have 
recently been brought in, and page faults should drop to a very 
low level 


m Prepaging 


pages other than the one demanded by a page fault are brought 
in 


exploits the characteristics of most secondary memory devices 


if pages of a process are stored contiguously in secondary 
memory it is more efficient to bring in a number of pages at 
one time 


ineffective if extra pages are not referenced 
should not be confused with "swapping" 


Non-Uniform Memory Access 


So far all memory accessed equally 
Many systems аге NUMA — speed of access to memory varies 


e Consider system boards containing CPUs and memory, 
interconnected over a system bus 


ш Optimal performance comes from allocating memory "close to" 
the CPU on which the thread is scheduled 


e And modifying the scheduler to schedule the thread on the 
same system board when possible 


e Solved by Solaris by creating lgroups 
» Structure to track CPU / Memory low latency groups 
» Used my schedule and pager 


› When possible schedule all threads of a process and 
allocate all memory for that process within the Igroup 


Operating System Concepts - 9th Edition 9.48 Silberschatz, Galvin and Gagne €2013 
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ш Determines where in real memory a process 
piece 1s to reside 


ш [Important design issue 1n a segmentation system 


ш Paging or combined paging with segmentation 
placing 1s irrelevant because hardware performs 
functions with equal efficiency 


и For NUMA systems an automatic placement 
strategy 1s desirable 


ш Deals with the selection of a page in main memory 
to be replaced when a new page must be brought 1n 


ш objective is that the page that is removed be the page 
least likely to be referenced іп the near future 


m The more elaborate the replacement policy the 
greater the hardware and software overhead to 
implement it 


= When a frame is locked the page currently stored in that frame 
may not be replaced 


= kernel of the OS as well as key control structures are held 
in locked frames 


" [/O buffers and time-critical areas may be locked into 
main memory frames 


= locking is achieved by associating a lock bit with each 
frame 


Basic Algorithms 


Algorithms used for 
the selection of a 


90019011101 page to replace: 
C. 


* Optimal 

* Least recently used (LRU) 
* First-in-first-out (FIFO) 

e Clock 


" Selects the page for which the time to the 
next reference is the longest 

" Produces three page faults after the frame 
allocation has been filled 


Page address 


stream 1 3 2 | 5 2 4 5 3 2 5 : 
OFT | | [3] [3x] 
КЕ} Ей = mp Ба Бәс EN 
F F F 


F= page fault occurring after the Frame allocation is initially filled 


Figure 8.15 Behavior of Four Page Replacement Algorithms 


m Replaces the page that has not been referenced for the longest 
time 


m By the principle of locality, this should be the page least likely 
to be referenced in the near future 


m Difficult to implement 


m one approach is to tag each page with the time of last 
reference 


m this requires a great deal of overhead dr 


LRU Example 
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stream 


. EE mx Ex Ex Бу ро Е 
LRU | E EX ЕКІ D [ENS 
E F 


F= page fault occurring after the frame allocation is initially filled 


Ligure 8.15 Behavior of Four Page Replacement Algorithms 


m Treats page frames allocated to a process as a circular buffer 


m Pages are removed in round-robin style 
= simple replacement policy to implement 


m Page that has been in memory the longest is replaced 


0100 
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FIFO Example 


Page address 
stream 


Е- page fault occurring after the frame allocation is initially filled 


Figure 5.15. Behavior of Four Page Replacement Algorithms 


m Requires the association of an additional bit with each frame 


m referred to as the use bit 


m When a page is first loaded in memory or referenced, the use bit 
is set to 1 


m The set of frames 15 considered to be a circular buffer 


m Any frame with a use bit of 1 1s passed over by the algorithm 


m Page frames visualized as laid out in a circle е D | 


Clock Policy Example 


Page address 
stream 


F= page fault occurring after the frame allocation is initially filled 


Figure 4.15 Behavior of Four Page Replacement Algorithms 
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(a) State of buffer just prior to a page replacement 


6 5 
(b) State of buffer just after the next page replacement 


Figure 8.16 Example of Clock Policy Operation 


Comparison of Algorithms 
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Figure 8.17 Comparison of Fixed-Allocation, Local Page Replacement Algorithms 
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Figure 8.18 The Clock Page-Replacement Algorithm [GOLD89] 


ombined Examples 
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F= page fault occurring after the frame allocation is initially filled 


Figure 8.18 Behavior of Four Page Replacement Algorithms 


и Improves paging A replaced page 15 
f d not lost, but 
perrormance an rather assigned to 


allows the use of one of two lists: 


a simpler page 
replacement 
policy 


Free page list Modified page list 


list of page frames 
available for 
reading in pages 


pages are written 
out in clusters 


m With large caches, replacement of pages can have a performance 
impact 


m if the page frame selected for replacement 1$ in the cache, that 
cache block is lost as well as the page that 1t holds 

m in systems using page buffering, cache performance can be 
improved with a policy for page placement in the page buffer 


m most operating systems place pages by selecting an arbitrary 


page frame from the page buffer 


m The OS must decide how many pages to bring into main memory 


m the smaller the amount of memory allocated to each process, 
the more processes can reside in memory 


m small number of pages loaded increases page faults 


m beyond a certain size, further allocations of pages will not 
effect the page fault rate 


Fixed-allocation 


gives a process a fixed 
number of frames in main 
memory within which to 
execute 


m when a page fault occurs, 
one of the pages of that 
process must be replaced 


Variable-allocation 


m allows the number of page 
frames allocated to a 
process to be varied over 
the lifetime of the process 


m The scope of a replacement strategy can be categorized as 
global от local 


m both types are activated by a page fault when there are no free 
page frames 


Local 


* chooses only among the resident pages of the process that generated 
the page fault 


Global 


* considers all unlocked pages in main memory 


Resident Set 
Management Summary 


Local Replacement Global Replacement 


Fixed Allocation «Number of frames allocated | *Not possible. 
to a process is fixed. 


*Page to be replaced is chosen 
from among the frames 


allocated to that process. 


Variable Allocation *The number of frames *Page to be replaced is chosen 


allocated to a process may be| from all available frames in 
changed from time to time to | main memory; this causes 
maintain the working set of the size of the resident set of 
the process. processes to vary. 


*Page to be replaced is chosen 
from among the frames 
allocated to that process. 


m Necessary to decide ahead of time the amount of 
allocation to give a process 


m If allocation 1s too small, there will be a high page fault 
rate 


НЫКЫ En © increased processor idle time 
large, there will be ; и 
• increased time spent in 


too few programs 
in main memory swapping 


m Easiest to implement 


m adopted in a number of operating systems 
m OS maintains a list of free frames 


m Free frame 15 added to resident set of process when a page fault 
occurs 


m If no frames are available the OS must choose a page currently in 
memory 


m One way to counter potential problems is to use page buffering 


m When а new process is loaded into main memory, allocate to it a 
certain number of page frames as its resident set 


m When a page fault occurs, select the page to replace from among 
the resident set of the process that suffers the fault 


m Reevaluate the allocation provided to the process and increase or 
decrease it to improve overall performance 


m Decision to increase or decrease a resident set size 1s based 
on the assessment of the likely future demands of active 
processes 


Key elements: 


* criteria used to determine 


resident set size 
* the timing of changes 


Sequence of 


Page Window Size, A 
References 
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sm | esa | - |. | Working Set 
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Working-Set Model 


ш A=working-set window = a fixed number of page references 
Example: 10,000 instructions 


m WSS, (working set of Process Р) = 
total number of pages referenced in the most recent A (varies in time) 


e if Atoo small will not encompass entire locality 
e if A too large will encompass several localities 
е if A= о > will encompass entire program 

m D=} WSS;= total demand frames 
e Approximation of locality 

m ifD>m= Thrashing 


ш Policy if D > m, then suspend or swap out one of the processes 


page reference table 
...2615777751623412344434344413234443444... 


WS(t,) = {1,2,5,6,7} WS(t,) = {3,4} 


Operating System Concepts - 9% Edition 9.53 Silberschatz, Galvin and Gagne €2013 


» Keeping Track of the Working Set 


ш Approximate with interval timer + a reference bit 
m Example: A = 10,000 
e Timer interrupts after every 5000 time units 
e Keep in memory 2 bits for each page 


e Whenever a timer interrupts copy and sets the values of all 
reference bits to O 


e If one of the bits in memory = 1 = page in working set 


Why is this not completely accurate? 
Improvement = 10 bits and interrupt every 1000 time units 


Operating System Concepts - 9'^ Edition 9.54 Silberschatz, Galvin and Gagne ©2013 


m Requires a use bit to be associated with each page 1n memory 


m Bit is set to 1 when that page is accessed 


m When а page fault occurs, the OS notes the virtual time since the 
last page fault for that process 


m Does not perform well during the transient periods when there is 
a shift to a new locality 


“> Page-Fault Frequency 


m More direct approach than WSS 


m Establish "acceptable" page-fault frequency (PFF) rate 
and use local replacement policy 


e |f actual rate too low, process loses frame 
e If actual rate too high, process gains frame 


2 increase number 
— of frames 
m 
5 upper bound 
Ф 
D 
кој 
О. 

lower bound 

decrease number 


of frames 


number of frames 


4 4 
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» Working Sets and Page Fault Rates 


m Direct relationship between working set of a process and its 
page-fault rate 


Working set changes over time 
Peaks and valleys over time 


working set 


time 


AÀ 


Operating System Concepts - 9'^ Edition 9.56 Silberschatz, Galvin and Gagne €2013 


Variable-interval Sampled 
Working Set (V$8WS). - 


m Evaluates the working set of a process at sampling instances based 
on elapsed virtual time 


m Driven by three parameters: 


the number of 
page faults that 
are allowed to 


the minimum the maximum 
duration of the duration of the 


occur between 
sampling 
Instances 


sampling sampling 
interval interval 


"Cleaning Policy ~ 


m Concerned with determining when a modified page should be 
written out to secondary memory 


Demand Cleaning 


a page is written out to secondary memory only when it has been selected for 
replacement 


Precleaning 


allows the writing of pages in batches 


m Determines the number of processes that will be resident in main 
memory 


m multiprogramming level 
m Critical in effective memory management 


m Too few processes, many occasions when all processes will be 
blocked and much time will be spent in swapping 


m Too many processes will lead to thrashing 


Multiprogramming | 
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Multiprogramming Level 


Figure 8.21 Multiprogramming Effects АА 


Process Suspension 


m If the degree of multiprogramming is to be reduced, one or more 
of the currently resident processes must be swapped out 


Six possibilities exist: 


* Lowest-priority process 

* Faulting process 

* Last process activated 

* Process with the smallest resident set 

* Largest process 

* Process with the largest remaining execution window 


m Intended to be machine independent so its memory 
management schemes will vary 
m early Unix: variable partitioning with no virtual memory 
scheme 


SVR4 and Solaris use 
two separate schemes: 


злато make use of 


• paging system 


* kernel memory allocator 


provides a virtual memory 
capability that allocates page frames 
in main memory to processes 


allocates page frames to disk block 
buffers 


Kernel Memory 
Allocator 


allocates memory for the kernel 


UNIX вува 


Memory (a) Page table entry 
| 


Management 


Formats (b) Disk block descriptor 


_ | Reference | Logical Block Pfdnta 


іс) Page frame data table entry 


Reference Page/starage 
count unit mum ber 


(d) Swap-use table entry 


Figure 8.22 UNIX ЧУКА Memory Management Formats 


Table 8.6 


UNIX SVR4 
Memory 
Management 
Parameters 
(page 1 of 2) 


Page Table Entry 


Page frame number 
Refers to frame in real memory. 


Age 
Indicates how long the page has been in memory without being referenced. The length and 
contents of this field are processor dependent. 


Copy on write 
Set when more than one process shares a page. If one of the processes writes into the page, a 
separate copy of the page must first be made for all other processes that share the page. This 
feature allows the copy operation to be deferred until necessary and avoided in cases where it turns 
out not to be necessary. 


Modify 
Indicates page has been modified. 


Reference 
Indicates page has been referenced. This bit is set to О when the page is first loaded and may be 


periodically reset by the page replacement algorithm. 


Indicates page is in main memory. 


Indicates whether write operation is allowed. 


Disk Block Descriptor 


Swap device number 
Logical device number of the secondary device that holds the corresponding page. This allows 
more than one device to be used for swapping. 


Device block number 
Block location of page on swap device. 


Type of storage 
Storage may be swap unit or executable file. In the latter case, there is an indication as to whether 
the virtual memory to be allocated should be cleared first. 


Table 8.6 


UNIX SVR4 
Memory 
Management 
Parameters 


(page 2 of 2) 


Page Frame Data Table Entry 


Page state 
Indicates whether this frame is available or has an associated page. In the latter 
case, the status of the page is specified: on swap device, in executable file, or 
DMA in progress. 


Reference count 
Number of processes that reference the page. 


Logical device 
Logical device that contains a copy of the page. 


Block number 
Block location of the page copy on the logical device. 


Pfdata pointer 
Pointer to other pfdata table entries on a list of free pages and on a hash queue of 


pages. 
Swap-Use Table Entry 


Reference count 
Number of page table entries that point to a page on the swap device. 


Page/storage unit number 
Page identifier on storage unit. 


m The page frame data table is used for page replacement 


m Pointers are used to create lists within the table 
m all available frames are linked together 1n a list of free frames 
available for bringing іп pages 
m when the number of available frames drops below a certain 
threshold, the kernel will steal a number of frames to 
compensate 
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Figure 8.23 Two-Handed Clock Page-Replacement Algorithm 


m The kernel generates and destroys small tables and buffers 
frequently during the course of execution, each of which requires 
dynamic memory allocation. 


m Most of these blocks are significantly smaller than typical pages 
(therefore paging would be inefficient) 


m Allocations and free operations must be made as fast as possible 


m Technique adopted for SVR4 


m UNIX often exhibits steady-state behavior in kernel memory 
demand 


m i.e. the amount of demand for blocks of a particular size 
varies slowly in time 


m Defers coalescing until it seems likely that it is needed, and 
then coalesces as many blocks as possible 


Lazy Buddy System Algorithm 


| Initial value of Dj is 0 
After an operation, the value of Dj is updated as follows 


(D) if the next operation is a block allocate request: 
if there is any free block, select one to allocate 
if the selected block 15 locally free 
then Dj := Dj +2 
else Dj := Dj + 1 
otherwise 
first get two blocks by splitting a larger one into two (recursive operation) 
allocate one and mark the other locally free 
Dj remains unchanged (but D may change for other block sizes because of the 
recursive call) 


(II) if the next operation is a block free request 

Case Dj 2 2 
mark it locally free and free it locally 
Di := Di -2 

Case Dj = 1 
mark it globally free and free it globally: coalesce if possible 
Di E 

Case Dj; = 0 
mark it globally free and free it globally; coalesce if possible 
select one locally free block of size 2! and free it globally; coalesce if possible 


Dj:-0 


Figure 8.24 Lazy Buddy System Algorithm 


m Shares many characteristics with Unix 


m Is quite complex 


* process virtual 
memory 


КЛ * kernel memory 


m Three level page table structure: 


Page directory Page middle directory 


v 
may span multiple pages may also span multiple pages | 
v 


Address Translation 


Virtual address 


Global Directory | ^ Middle Directory Page Table 


Page table Ж 


Page middle en 
directory Page frame 
in physical 
memory 


Figure 8.25 Address Translation in Linux Virtual Memory Scheme 


m Based on the clock algorithm 


m The use bit is replaced with an 8-bit age variable 
m incremented each time the page 1s accessed 


m Periodically decrements the age bits 


m а page with an age of 0 is an “old” page that has not been 
referenced 1s some time and 1s the best candidate for 
replacement 


m A form of least frequently used policy 


Kernel Memory Allocatio 


m Kernel memory capability manages physical main memory page 


n 


frames 


m primary function is to allocate and deallocate frames for particular 


uses ' 
Possible owners of a frame include: 


* user-space processes 


* dynamically allocated kernel data 
* static kernel code 
* page cache 


m Abuddy algorithm 15 used so that memory for the kernel can be 


allocated and deallocated 1n units of one or more pages 


small short-term memory chunks in odd sizes 


m Slab allocation 


m used by Linux to accommodate small chunks 


Page allocator alone would be inefficient because the kernel requires 


m Virtual memory manager controls how memory is allocated and 
how paging is performed 


m Designed to operate over a variety of platforms 


m Uses page sizes ranging from 4 Kbytes to 64 Kbytes 


m On 32 bit platforms each user process sees a separate 32 bit 
address space allowing 4 Gbytes of virtual memory per process 


" by default half is reserved for the OS 


m Large memory intensive applications run more effectively using 
64-bit Windows 


m Most modern PCs use ће AMD64 processor architecture which 
is capable of running as either a 32-bit or 64-bit system 
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Figure 8.26 Windows Default 32-bit Virtual Address Space 


m On creation, a process can make use of the entire user space of 
almost 2 Gbytes 


m This space is divided into fixed-size pages managed in 
contiguous regions allocated on 64 Kbyte boundaries 


m Regions may be in one of three states: 


available reserved 


m Windows uses variable allocation, local scope 


m When activated, a process is assigned a data structure to manage 
its working set 


m Working sets of active processes are adjusted depending on the 
availability of main memory 


m Desirable to: 


m maintain as many processes in main memory as possible 


m free programmers from size restrictions in program 
development 


m With virtual memory: 


m all address references are logical references that are translated 
at run time to real addresses 


ш a process can be broken up into pieces 
m two approaches are paging and segmentation 


m management scheme requires both hardware and software 
support 


